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i— H Abstract 



This note motivates a version of the generalized pythagorean that 
says: if A is an n x k matrix, then 



o 

^ det(A*A) = det(^ 7 ) 2 

where the sum is over all A; x k minors of A. This is followed by a proof 
via an elementary computation in exterior algebra. The author hopes 
it is accessible to anybody who has learned calculus with differential 
forms. 

o 

(N 

1 Introduction 

The pythagorean theorem is one of the first theorems of geometry that people 
learn. If a right triangle has legs of length a and b and its hypotenuse has 
t> length c then 

a + o = c . 



The Playfair proof of the Pythagorean theorem is easy to explain, but some- 
how mysterious. 
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b a 




a b 
Behold! 



The pythagorean theorem leads to the distance formula in the plane. If 
(xi, yi), (x2, y-i) are two points in the plane the distance between them is 
\J (x~i — X2) 2 + (yi — 1/2) 2 , which flows from the pythagorean theorem via the 
diagram below. 




If you tell someone that the Pythagorean theorem generalizes to any 
dimension, often they will think that the generalization is just the distance 
formula for points in W 1 . However, suppose that T is the tetrahedron with 
vertices at (0,0,0), (a, 0,0), (0,6,0), and (0,0, c). 
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(0,0, c) 



(a, 0,0) 




(0,6,0) 



You can imagine that the tetrahedron has three legs, which are right 
triangles that lie in the xy-plane, xz-plane, and yz-plane. The hypotenuse 
is the triangle having vertices (a, 0, 0), (0, b, 0) and (0, 0, c). The sum of the 
squares of the areas of the three legs is 



The base of the hypotenuse is y/a 2 + b 2 and its height is y Jrpp- + c 2 . The 
area of the hypotenuse is one half of its base times its height, 



Squaring this gives the sum of the squares of the areas of the legs! This 
computation is sometimes referred to as de Gua's theorem after Jean Paul 
de Gua de Malves who was an 18th century French mathematician [2]. 

The statement that this tetrahedron is right, boils down to the legs of 
the tetrahedron being the orthogonal projections of the hypotenuse into the 
coordinate hyperplanes. The shapes we have been using so far, triangles 
and tetrahedra, are examples of simplices. Simplices are well and good, but 
for the sake of discussing the pythagorean theorem in the language of linear 
algebra, parallelepipeds are better. 

Suppose that v = (a, b, c) and w = (d, e, /) are vectors in space. The 
parallelogram spanned by v and w is everything of the form sv + tw where 



1 1 1 

1 2t2, 1 22, - L i2 2 

-a b + -a c + -b c . 
AAA 




-V 'a 2 b 2 + a 2 c 2 + b 2 c 2 . 
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s,t e [0,1]. The orthogonal projection of the parallelogram into the xy- 
plane is spanned by (a, b) and (d,e). Using the standard area formula via 
determinants its area is \ae — bd\. Its orthogonal projection into the yz-plane 
is spanned by (6, c) and (e, /) and has area \bf — ce|. Finally its projection 
into the xz-plane is spanned by (a, c) and (d, f) and has area \af — cd\. The 
Pythagorean theorem says that the square of the area of the parallelogram 
in space is the sum of the squares of the areas of the projections into the 
coordinate hyperplanes. 

(ae - bdf + (bf - cef + (af - cdf 

but this is just the norm squared of the cross product (a, b, c) x (d, e, /), which 
confirms a well known formula for the area of a parallelogram in space. 

In general, the parallelepiped spanned by vectors Vi,v 2 , ■ ■ ■ ,Vk is every- 
thing of the form Yli=i ^i^i where the Aj vary over the unit interval [0, 1]. 
The pythagorean theorem says that if P is a parallelepiped in W 1 spanned 
by A;- vectors Vi,V2, ■ ■ ■ ,Vk then the square of the /c-dimensional content of 
P, is the sum of the squares of the /c-dimensional content of the orthogonal 
projections of P into the /c-dimensional coordinate hyperplanes in IR n . It is 
the point of this note to give a statement in linear algebraic terms of this 
theorem, and prove it. 

2 Content 

Let V, W be innerproduct spaces, and L : V — >■ W a linear map. We can 
restrict L to get, 

L : ker L L — > imL, 

where by ker L 1 - we mean the subspace of V made up of all vectors that are 
pependicular to the kernel of L and by imL we mean the image of L. As 
kerL- 1 and imL are subspaces of innerproduct spaces they are themselves 
innerproduct spaces. Hence we can choose orthonormal bases for them and 
represent L as a matrix with respect to those bases. The matrix Uj is 
square. 

Definition 1. The content of L, denoted c(L) is the absolute value of the 
determinant of any matrix representation kj of L with respect to orthonormal 
bases of the perpendicular to its kernel, and its image. 
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The content is not as well behaved as the determinant, for instance it 
doesn't have a sign. Also the content of the composition of two linear maps 
is not in general the product of their contents. However, 

Proposition 1. If L : V — >■ W and M : W — > U are linear maps of inner- 
product spaces and imL = ker M 1 - then c(M o L) — c(M)c(L). 

Proof. Since imL = ker M 1 - one of the orthonormal bases you use to compute 
c(L) can be chosen to coincide with one of the orthonormal bases used to 
compute c(M). If ly and rrij k are the matrices that represent L and M with 
respect to this choice, then the matrix representing MoL is the product of kj 
and rrijk- The determinant of the product is the product of the determinants. 

□ 

If L : V — > W is a linear map of innerproduct spaces then it has an 
adjoint L* : W — )■ V defined by requiring L* to solve the equation, 

< L(v),w > w =< v,L*(w) > v 

for all v G V and w G W where < , >y and < , >w denote the innerproducts 
in V and W. If we have represented L with respect to orthonormal bases 
of V and W then the matrix of L* with respect to those bases is just the 
transpose of the matrix representing L. From this it is clear that 

c(L) = c(L*). 

Using Proposition 1, and the fact that the image of L* is the perpendicular 
to the kernel of L we arrive at c(L o L*) = c(L) 2 , and using the fact that 
L** = L we have c(L* o L) — c(L) 2 . 

Proposition 2. For any linear map of innerproduct spaces L : V — > W , 

c{L*oL) = c{LoL*) = c{L) 2 . 

Suppose / : M — > N is an immersion of manifolds and iV is Riemannian. 
Choose local coordinates x 1 , x 2 , . . . , x k at p G M. The matrix 

d d 
9ij =< d M^)' d M^j) >T f( P ) N 

is the first fundamental form of the induced Riemannian metric on M and 
the associated volume form is 

\/det{g)dx x A ... A dx k . 
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You can think of y/detg(p) as the /c-dimensional content of the parallelepiped 
spanned by the vectors 

d d 

)rthonormal bas 

representing 



Choose an orthonormal basis for Tf^N and let Vi be the column vector 



dfpi^lp) 

with respect to this basis. Let A be the n x /c-matrix whose columns are the 
Vi. Notice that A 1 A = g, so 

c(A) = ^det(g). 

In general, you can think of an n x k matrix A as a list of k column 
vectors and c(A) as the /c-dimensional content of the parallelepiped spanned 
by those vectors. Choose I C {1,2, ... ,n} with /c-elements. We use |/| to 
denote the cardinality of /. By Aj I mean the k x A;- matrix made from the 
rows of A corresponding to the subset /. The parallelepiped spanned by Aj 
can be thought of as the orthogonal projection of the parallelepiped spanned 
by A into the /c-dimensional coordinate hyperplane determined by /. This 
brings us too: 

Theorem 1 (Pythagorean Theorem). Let A be an n x k-matrix, then 
det(A l A) = det^f. 

/C{l,...,n} \I\=k 

That is, the square of the content of the parallelepiped of spanned by A is equal 
to the sum of the squares of the orthogonal projections of the parallelepiped 
into the k- dimensional coordinate hyperplanes. 

We will prove this theorem after we develop some vocabulary for manip- 
ulating determinants. 



3 Exterior Algebra and the Proof 

Let V be a vector space over k. The exterior algebra A*(V) of V, is the 
associative algebra over k that is the quotient of the free unital associative 



6 



algebra over k on V modulo the relations that v A v = for every v £ V, 
where we are denoting the multiplication by a wedge [1] . It is an elementary 
fact that the relation v A v = for all v implies that v A w = —w A v for all 
v and w. 

We will be working with innerproduct spaces. Suppose that e ( i G 
{1, . . . , n} is an orthogonal basis of V. HI— i 2 , ■ ■ ■ , ik} C {1, 2, . . . , n} 
with %x < %% ■ ■ ■ < ik let 

er = e h Ae i2 ... A e ik . 

If we let e0 = 1, then the ej where J ranges over all subsets of {1,2,..., n} 
is a basis for A*(V) In fact, A*(V) is graded by \I\. We let Ai(V) be the 
subspace spanned by all e/ with \I\ = i. Notice A»(V) has dimension m. 
We make A ! „(V A ) into an innerproduct space, by declaring that the ej are an 
orthonormal basis. 

If W is any innerproduct space we denote the identity map from W to 
itself by Idw- If L : V — > W is a linear map of innerproduct spaces then for 
all i, it induces Aj(L) : Aj(V) — >• Aj(W), for all i. It is defined by letting 

L(vi A v 2 ■ ■ ■ A Vi) = L(vx) A L(t7 2 ) A ... A L{vi). 

The assignment L — > Aj(L), is functorial as 

Ai(Id v ) = Id Ai{v) , 

and 

A i (LoM) = A i (L)oA,(M). 

where L : V —¥ W and M : W — > U are any linear mappings between 
innerproduct spaces. 

Proposition 3. If we choose orthonormal bases ej for V and fi for W and 

let lij be the matrix of f with respect to these bases, if \ J\ = i then 

L(ej) = M "fi 
[j[=i 

where Mjj is the determinant of the i x i submatrix of 1^ whose rows and 
columns come respectively from the sets I and J in order. 
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Proof. Notice that L(ej) = J2i^jfi- Expanding, 

A(W = (%)A-A%)) = 

k k k 

After a lot of cancellation and reordering we arrive at: 

= Yl Yl s 9 n ( a ) II l im,Mm) I //. 

7C{l,...,n} o-e5i \m=l / 

where Si denotes the symmetric group on i letters, and sgn(a) is the sign of 
the permutation sigma. Notice that the sum over Si of the signed products 
is a classical formula for the determinant, so it reduces to 

= M "fi 
\i\=i 

where the Mjj is the determinant of the minor whose entries are indexed by 
ij with i E I and j G J.. □ 

Now suppose that dim(V) = dim(W) = k. Notice dimA k (V) = dimA k (W) = 
1, and 

A fc (L)(e { i > ... )fc }) = det(Zij)/{i,..,fc}- 

Proposition 4. If L : V — >■ W is a linear map of innerproduct spaces then 
the map Aj(L*) is equal to Aj(L)*. 

Proof. Choose orthonormal bases and for V and respectively. That 
way, if lij is the matrix of L with respect to those bases then the matrix of L* 
is Iji. Notice ej and // are orthonormal bases of Aj(V) and Aj(W) where we 
let / and J range over subsets with % elements in their respective index sets. 
Hence if Cu is the matrix of Aj(L) with respect to the bases ej and // then 
the matrix of Aj(L*) is Here I am using the fact that the determinant 
of the transpose of a matrix is equal to the determinant of the matrix in 
computing the coefficients of the matrix for Aj(L*). □ 

We are ready. 
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Proof. Theorem [T] Let A be an n x k matrix, which we can view as the 
matrix of a linear map A : M. h — > R n with respect to the standard orthonormal 
bases. The determinant of A 1 A is expressed as 

A k (A* o A)(e {lr „ >k} ) = det(A t A)e {lj ,.. M . 

By the functoriality of the left hand side is 

A fc (^*)oA*(^)(e {w} ). 

However, we have 

Afc(^)(e { i,..., fc} ) = J2 det(A T )ei, 

/C{l,...n} |/|=fc} 

where det(Aj) is the determinant of the k x fc-matrix whose rows correspond 
to the rows of A whose index is in I. Recall, A k {A*) = A k (A)*. Going back 
to the definition of adjoint we arrive at 

A fc (A*)(ej) = det(A / )e {1) ... ifc} . 

Therefore, 

A k {A*) o A fc (.A)(e {1 ,.„ >fe} ) = A k {A*){ det(Aj) ei ) = 

/C{l,...n} |/|=fe} 

det(A / )A fe (A*)(e / ) = 

IC{l,...n} \I\=k} 

det^) 2 e { i,..., fc} . 

\/C{l,..n} |/|=fc} / 

Putting it all together we have, 

detiA'A) = det{Aj) 2 . 

/C{l,...n} \I\=k] 

□ 
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4 Epilogue 



This is a theorem that gets rediscovered over and over again. Via a web search 
I found a note by Alvarez [3] that proves the theorem for right n-simplices, 
but better than that has a nice bibliography with some references to proofs 
and historical texts. He cites evidence that the theorem first appeared in 
a book on analytic geometry by Monge and Hatchet written in the 19th 
century. There is also a paper by Atzema [I] that proves the same theorem I 
proved here, via a more general result about the determinant of a product of 
matrices due to Cauchy. I would be surprised if a proof along the lines that 
I gave here, didn't appear elsewhere. 

I think the theorem might be of pedagogical interest as it gives a unified 
paradigm for integrals to compute arc length and area that could leave the 
students in a position to set up integrals to compute higher dimensional 
content. 
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